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Buffer- Aided Relaying with Adaptive Link Selection 



Nikola Zlatanov, Robert Schober, and Petar Popovski 

Abstract 

In this paper, we consider a simple network consisting of a source, a half-duplex decode-and-forward relay, 
■ and a destination. We propose a new relaying protocol employing adaptive link selection, i.e., in any given time 

slot, based on the channel state information of the source-relay and the relay-destination link a decision is made 
whether the source or the relay transmits. In order to avoid data loss at the relay, adaptive link selection requires 
. the relay to be equipped with a buffer such that data can be queued until the relay-destination link is selected for 

transmission. We study both delay constrained and delay unconstrained transmission. For the delay unconstrained 
1 case, we characterize the optimal link selection policy, derive the corresponding throughput, and develop an optimal 

power allocation scheme. For the delay constrained case, we propose to starve the buffer of the relay by choosing the 
decision threshold of the link selection policy smaller than the optimal one and derive a corresponding upper bound 
on the average delay. Furthermore, we propose a modified link selection protocol which avoids buffer overflow 
by limiting the queue size. Our analytical and numerical results show that buffer-aided relaying with adaptive link 
selection achieves significant throughput gains compared to conventional relaying protocols with and without buffers 
0^ . where the relay employs a fixed schedule for reception and transmission. 

CO 

in. 

rs j. I. Introduction 

o: 

CNJ . The classical three-node relay channel was originally considered by van der Meulen [1]. Cover and El 
; Gamal [2] investigated the capacity of a memoryless relay channel consisting of a source, a destination, and 
^ ■ a single full-duplex relay. Recently, renewed interest in relay-assisted communication was sparked by [3] 
c3 | and [4]. Since then, the simple three-node system has become a building block for more sophisticated relay 
networks and a host of cooperative communication techniques have been proposed [5]-[12]. The capacity 
of a three-node relay channel comprised of a source, a destination, and a single half-duplex decode-and- 
forward (DF) relay was investigated in [5]. Under the assumption of full channel state information (CSI) 
at every node, it was shown in [5] that the capacity of a three-node DF network without a direct source- 
destination link is given by the minimum of the source-relay and the relay-destination link capacities for 
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the case when the relay employs a fixed schedule for reception and transmission. In [5]-[12] and most 
of the existing literature on half-duplex relaying, it is assumed that the relays receive a packet from the 
source in one time slot and forward it to the destination in the next time slot. Relay protocols operating 
under this restriction are referred to as "conventional" relay protocols in the following. In this paper, we 
abandon this paradigm and give relays the freedom to decide in which time slot to receive and in which 
time slot to transmit. This new approach requires the relays to have buffers. Relays with buffers were also 
considered in [13] and [14]. In [13], the buffer at the relay was used to enable the relay to receive for 
a fixed number of time slots before retransmitting the received information. In [14], relay selection was 
considered and buffers enable the selection of the relay with the best source-relay channel for reception 
and the relay with the best relay-destination channel for transmission. However, similar to conventional 
relay selection without buffers [15], the source transmits in every other time slot. Thus, both [13] and [14] 
do not fully exploit the flexibility offered by relays with buffers since the schedule of when the source 
transmits and when a relay transmits is a priori fixed. 

In this paper, we consider buffer-aided relaying with adaptive link selection, where in any given time slot 
based on the CSI of the source-relay and the relay-destination link a decision is made whether the source or 
the relay transmits. We consider the cases of delay constrained and delay unconstrained transmission. For 
the delay unconstrained case, we optimize the link selection protocol and the power allocated to the source 
and the relay. Interestingly, the optimal link selection policy requires only knowledge of the instantaneous 
CSI of the considered time slot and the statistical CSI of the involved links. However, the instantaneous 
CSI of past and future time slots and the state of the relay's buffer are not required for optimal link 
selection, which facilitates the implementation of the optimal policy. For the delay constrained case, we 
propose two alternative link selection protocols and provide an upper bound for the average delay of one 
of them. Our analytical and simulation results show, in good agreement, that buffer-aided relaying with 
adaptive link selection can achieve large performance gains compared to conventional relaying with or 
without buffer [13], as long as a certain delay can be tolerated. 

The remainder of the paper is organized as follows. In Section [TH the considered system and channel 
models are presented. The proposed link selection protocol for buffer-aided relaying is introduced and 
optimized in Section [Till and optimal power allocation for source and relay is discussed in Section |IVJ In 
Section [Vl we propose two protocols for buffer- aided relaying with delay constraints. Numerical results 
are presented in Section [VT] and some conclusions are drawn in Section IVIIl 
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II. System Model 

We consider a three-node communication system comprising a source S, a half-duplex relay 71, and a 
destination V, cf. Fig. [Q The source can communicate with the destination only through the relay, i.e., 
there is no direct S-V link. The source sends packets to the relay, which decodes these packets, possibly 
stores them in its buffer, and eventually sends them to the destination. Throughout this paper, we assume 
that the source has always data to transmit. 

A. Channel Model 

We assume that time is divided into slots of equal lengths. In the ith time slot, the transmit powers of 
source and relay are denoted by Ps(i) and Pr(i), respectively, and the instantaneous (squared) channel 
gains of the S-1Z and TZ-V links are denoted by hs(i) and hn(i), respectively. hg{i) and hn(i) are 
modeled as mutually independent, non-negative, stationary, and ergodic random processes with expected 
values E{hs(i)} — &s an d E{hji(i)} — CIr, where E{-} denotes expectation. We assume that the channel 
gains are constant during one time slot but change from one time slot to the next due to e.g. the mobility 
of the involved nodes and/or frequency hopping. 

The instantaneous link signal-to-noise ratios (SNRs) of the S-1Z and TZ-V channels in the zth time 
slot are given by s(i) = 7s(z')/is(z) and r{i) = 7/?(i)/iR(i)> respectively. Here, 7s(i) = Ps(i)/crl s and 
7r(z) = PR(i)/cr^ R denote the transmit SNRs of the source and the relay, respectively, and a^ s and a^ R are 
the variances of the additive white Gaussian noise (AWGN) at the relay and the destination, respectively. 
The average link SNRs are denoted by £1$ = E{s(i)} and VIr = E{r(i)}. 

B. Link Adaptive Transmission Protocol 

In the proposed link adaptive transmission protocol, one of the nodes of the network is responsible for 
deciding whether the source or the relay should transmit in a given time slot i. This node is referred to 
as the central node in the following. The central node broadcasts its decision to the other nodes before 
transmission in time slot i begins. If they are selected for transmission, the source and the relay adapt 
their transmission rates to the capacity of the respective link and transmit codewords spanning one time 
slot. We assume that source and relay employ capacity-achieving codes. For both link selection and rate 
adaptation, the nodes require CSI knowledge as will be detailed in the following. 

CSI requirements: The central node requires knowledge of the instantaneous channel gains hs(i) and 
1ir{i). In addition, regardless of which node is the central node, if the S-1Z link is selected, the source and 
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the relay require knowledge of hs(i) for rate adaptation and decoding, respectively. On the other hand, 
if the TZ-V link is selected, the relay and the destination require knowledge of }ir(i) for rate adaptation 
and decoding, respectively. Nodes can obtain the instantaneous channel gains through estimation based on 
pilot symbols emitted by other nodes and/or CSI feedback from other nodes. Furthermore, we assume that 
the central node knows the noise variances a^ s and a\ R , and regardless of which node is the central node, 
source, relay, and destination know o% , (<7^ , <r£ ), and a 2 nR , respectively. Also, if the transmit power 
is a priori fixed, i.e., Ps(i) = Ps and Pr(i) = Pr, Vi, the central node requires knowledge of P$ and 
Pr, and source, relay, and destination require knowledge of Ps, (Ps,Pr), and P R , respectively. If power 
allocation is employed, Ps(i) and -Pr(«) are computed by the nodes based on their respective knowledge 
of the instantaneous channel gains and statistical CSI knowledge (cf. Section [TV] (1281) . d29l)). 

Which node serves as the central node depends on the network architecture. For example, in the downlink 
of a cellular network, the source (base station) can serve as the central node as it typically acquires the 
full CSI of all links anyways and can afford the complexity of performing adaptive link selection and 
power allocation. On the other hand, if the relay serves as the central node, source and destination have 
to acquire only the CSI of the S-1Z and 1Z-V links, respectively. 

For convenience we normalize the number of bits transmitted in one time slot by the number of symbols 
per time slot. Thus, throughout the remainder of this paper, the number of bits refers to the number of 
bits divided by the number of symbols in a codeword. In the following, we discuss the network dynamics 
when source and relay transmit, respectively. 

Source transmits: If the source is selected for transmission in time slot i, it transmits with rate 



Hence, the relay receives Ssiz(i) data bits from the source and appends them to the queue in its buffer. 
The (normalized) number of bits in the buffer of the relay at the end of time slot i is denoted by Q(i) 
and given by 



Relay transmits: If the relay transmits in time slot i, the number of bits transmitted by the relay is 
given by 



S S n(i) = log 2 (l + s(z)). 



(1) 



Q(i) = Q{i-l) + S sn (i). 



(2) 



Rnv(i) = min{log 2 (l + r(i)), Q(i - 1)} 



(3) 
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where we take into account that the maximal number of bits that can be send by the relay is limited by the 
number of bits in its buffer and the instantaneous capacity of the 1Z-V link. The number of bits remaining 
in the buffer at the end of time slot i is given by 

Q(i) = Q(i-l)-RRv(i), (4) 

which is always non-negative because of ©. 

Because of the half-duplex constraint, we have Rnv{i) = when the source transmits (and the relay 
listens), and we have Ssn(i) = when the relay transmits. 

C. Throughput 

Since we assume the source has always data to transmit, the average (normalized) number of bits that 
arrive at the destination per time slot is given by 

1 N 

r = i im afX)-^^)' (5) 

N^oo iv z — ' 
i=\ 

i.e., r is the throughput of the considered communication system. The goal of the following sections is 
the maximization of r by optimizing the adaptive link selection protocol and the transmit power allocated 
to source and relay. 

D. Conventional Relaying 

For comparison purpose, we provide the throughput of two baseline schemes. Thereby, we assume that 
the transmit powers at the source and the relay are fixed, i.e., Ps(i) = Ps, Pn(i) — Pr> Vi. 

1 ) Conventional relaying without buffer: The instantaneous throughput of conventional relaying without 
buffer, where the relay receives a packet in one time slot and transmits it in the next, is given in [5], and 
the corresponding average throughput is 

N/2 



Tconv,i = Jim min{log 2 (l + s(2i - 1)), log 2 (l + r(2i))} 



i=l 

= if7{min{bg 2 (l + s(z)),log 2 (l + r(z))}}, (6) 

where the ergodicity of s(i) and r(i) was exploited. Note that conventional relaying without buffer 
introduces a delay of one time slot since the relay has to wait until the entire codeword is received 
and decoded before sending the codeword to the destination. 
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2) Conventional relaying with buffer [13]: In conventional relaying with buffer as proposed in [13], 
the relay receives data from the source in the first N/2 (N is even) time slots and sends this cumulative 
information to the destination in the next N/2 slots. The corresponding maximum achievable average 
throughput is obtained for N — > oo and given by 



7conv,2 

N 



1 (N/2 N } 

lim — min I Vlog 2 (l + s(z)), V log 2 (l + r(i)) > 

\T— >oo iV — — 

^ i=l i=N/2 ) 



= ~mm{E{\og 2 (l + s(i))},E{\og 2 (l + r(i))}}. (7) 

Comparing © and O, we observe that r conv 2 > r conv ^ holds [13]. However, to realize this performance 
gain, the relay has to be equipped with a buffer of infinite size and an infinite delay is introduced. 

3) Rayleigh fading: For the numerical results shown in Section |VIJ we consider the case where the 
S-1Z and 1Z-V links are both Rayleigh faded, i.e., the probability density functions (pdfs) of s(i) and r(i) 
are given by f s (s) = e~ s ^ s /fig and f r (r) = e~ r ^ nR /Q,R, respectively. In this case, r collVi i and r con v,2 can 
be obtained in closed form as 

and 

t — = min { exp (£) Ei (£) • exp ik) Ei ik) } ■ <9) 

respectively, where Ei(x) = e~ l jt dt, x > 0, denotes the exponential integral function. 

III. Adaptive Link Selection 

To gain some insight, we assume throughout this section constant source and relay powers, i.e., Pg(i) = 
Ps, Pr(i) = Pr, Vi, and a buffer of unlimited size at the relay. For this case, we derive the optimal link 
selection policy and the corresponding throughput. Optimal power allocation and the effect of a limited 
buffer size will be discussed in Sections [IV] and [V] respectively. 

A. Problem Formulation 

Let di E {0, 1} denote a binary decision variable. We set di — 1 if the 1Z-V channel is selected for 
transmission in time slot i, i.e., the relay transmits and the destination receives. Similarly, we set di = if 
the S-1Z channel is selected for transmission in time slot i, i.e., the source transmits and the relay receives. 
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Exploiting dj, the number of bits send from the source to the relay and from the relay to the destination 
in time slot i can be written in compact form as 

Ssnii) = (1 - di)S(i) (10) 

and 

R KD (i) = d i isnn{R(i),Q(i-l)}, (11) 

respectively, where S(i) = log 2 (l + s(i)) and R(i) = log 2 (l + r(i)). Consequently, the throughput in © 
can be rewritten as 

1 - 

T = lim T7 5^^ min W),Q(^-l)}- (12) 

TV— >oo iV — * 

8=1 

The considered problem can now be stated as follows: Find the optimal link selection policy, i.e., the 
optimal sequence di, % > 1, which maximizes throughput r. 

Remark 1: Our problem formulation is quite general in the sense that we have introduced no restrictions 
concerning the required knowledge regarding the channel and the queue states. In other words, the optimal 
decision at time j, dj, potentially depends on h s (i), h R (i), and Q(i), i > 1, and thus requires non-causal 
channel knowledge. Fortunately, as will be shown in the ensuing section, this is not the case and the 
optimal policy turns out to be rather simple and easy to implement. 

B. Optimal Link Selection Policy 

Let us first define the average arrival rate of bits per slot into the queue of the buffer, A, and the average 
departure rate of bits per slot out of the queue of the buffer, D, as [16] 



1 N 

A -^ m vD 1 -*)^) (13) 

TV— >oo iV ^ — J 



t=l 

and 



D = lim ■iy;d i miii{i*(i),Q(z-l)}, (14) 

8=1 

respectively. We note that the average departure rate of the queue is equal to the throughput. The queue is 
said to be absorbing if A > D = r. The following theorem characterizes the optimal link selection policy 
in terms of the state of the queue in the buffer of the relay. 
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Theorem 1: A necessary condition for the optimal link selection policy, which maximizes the through- 
put, is that the queue in the buffer of the relay is at the edge of non-absorbtion, i.e., the queue is non- 
absorbing but is at the boundary of a non-absorbing and an absorbing queue. 

Proof: Please refer to Appendix [Al ■ 
Exploiting Theorem [H we can establish a useful condition that the optimal link selection policy has to 
fulfill and a simplified expression for the throughput. This is the subject of the following theorem. 

Theorem 2: Assuming non-negative, ergodic, and stationary random processes s(i) and r(i), for the 
optimal link selection policy the identity 

E{(l-d i )S(i)} = E{d i R(i)} (15) 

holds and the throughput is then given by 

r = E{diR(i)}. (16) 
Proof: Please refer to Appendix El ■ 

Remark 2: According to Theorem [2l for the optimal link selection policy, the queue is non-absorbing 
but is almost always filled to such a level that the number of bits in the queue exceed the number of 
bits that can be transmitted over the TZ-V channel. In particular, as shown in Appendix HH condition (IT3T ) 
automatically ensures that for N — > oo, ^^2^ =1 diR(i) = 4 J2f=i di ram{R(i), Q{i — 1)} is valid, i.e., 
the impact of event R(i) > Q(i — 1), i = 1, . . . , N, is negligible. 

We are now ready to derive the optimal link selection policy for buffer- aided relaying. According to 
Theorem [2l the policy that maximizes the throughput r in (fT6l) can only be found inside the set of policies 
that produce a queue which satisfies (fT5l) . and not outside of this set of policies. Thus, for N — > oo, we 
formulate the following optimization problem: 

Maximize : 4 J2f=i diR{i) 

di 

Subject to: Cl:£££i(l- W) = ^£tiW) (17) 
02:^(1-^=0 Vz 

where constraint CI ensures that the search for the optimal policy is conducted only among those policies 
that satisfy (fT5l) and C2 ensures that di e {0, 1}. We note that CI and C2 do not exclude the case that 
the relay is chosen for transmission if R{i) > Q(i — 1). However, according to Remark [2l CI ensures that 
the influence of event R(i) > Q(i — 1) is negligible. Therefore, an additional constraint dealing with this 
event is not required. The solution of problem (fT7l) leads to the following theorem. 



Theorem 3: The optimal policy maximizing the throughput of buffer-aided relaying with adaptive link 
selection is given by 

/ 1 if .F(r(i)) > pF{s{i)) 
di= < (18) 
otherwise 

where p is referred to as the decision threshold and the optimal decision function is given by 

F{x) =log 2 (l + z). (19) 

The optimal decision threshold p, p opt , has to satisfy (TT5T) . 

Proof: Please refer to Appendix O ■ 

Remark 3: Interestingly, we observe from Theorem [3] that the optimal decision, di, at time slot i, 
depends only on the instantaneous SNRs, s(i) and r(i), of that time slot. d{ does not depend on the state 
of the queue, Q(i), in any time slot nor on the instantaneous SNRs in previous or future time slots. This 
makes the proposed optimal selection policy highly practical. We note that the optimal decision threshold, 
p opt , depends on the statistical CSI of both involved links as will be established in the next section. The 
independence of the optimal link selection policy from non-causal instantaneous CSI is caused by the 
relay being operated at the edge of non-absorbtion, i.e., the relay node is practically fully backlogged. 
Non-causal knowledge would only help buffer management (i.e., ensuring that there is a sufficient number 
of bits in the buffer for upcoming time slots), which is not required in the considered regime. 

Remark 4: In this paper, we assume that the transmitting nodes have perfect CSI and apply adaptive 
rate transmission. However, we note that this is not necessary for achieving the maximum throughput in 
(fT6l) . In fact, the proposed adaptive link selection protocol (fT8~l) also achieves the maximum throughput 
in (PT6l) if source and relay transmit long codewords that span (ideally infinitely) many time slots (and 
consequently many channel states). In this case, both the source and the relay can transmit with constant 
rate r = E{{\ — di)S(i)} = E{diR(i)} and rate adaptation is not necessary. The first codeword is 
transmitted by the source without link adaptation and decoded by the relay. For all subsequent codewords, 
link adaptation is performed based on (TTBl and source and relay transmit parts of a long codeword whenever 
they are selected for transmission. The disadvantage of this approach is that the long codewords inherently 
introduce (ideally infinitely) long delays and the generalization of this approach to the delay constrained 
case is difficult. Therefore, in this paper, we consider adaptive rate transmission and assume that one 
codeword spans only one time slot (and consequently one channel state). 
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C. Generalization of the Decision Function and Optimal Decision Threshold 

The optimal decision function in (fT9l) is intuitively pleasing since it basically means that link selection 
is performed based on the instantaneous link capacities. Nevertheless, for some fading distributions, the 
optimal decision function may lead to complicated expressions for the throughput and the optimal decision 
threshold p opt . In such cases, simpler suboptimal decision functions such as J-(x) = x may be preferable 
as they generally lead to a similar performance as F{x) = log 2 (l + x) but are analytically more tractable. 
Thus, in the following, we generalize the decision function, jR(x), to be any non-negative, smooth, and 
increasing function, i.e., T(x + e) > F{x) for e > 0. We assume that the inverse of exists and denote 
it by 

We note that Theorems \T\ and [2] also hold for suboptimal decision functions J-'(x). For a given JF{x), 
the corresponding decision threshold p has to satisfy (fT3T) . Thus, the optimal decision threshold, p opt , for 
a given (optimal or suboptimal) decision function, JF{x), can be computed based on the following lemma. 

Lemma 1: Denote the pdfs of s(i) and r(i) by f s (s) and f r (r), respectively. For a given decision 
function, J-'(x), the optimal p opt is the solution of 



log 2 (l + s)f s (s)ds 

UG(r) 



f r (r)dr 



log 2 (l + r)f r {r)dr 

UH(s) 



f s (s)ds, (20) 



where the integral limits are given by G(r) = J r_1 (J r (r)/p) and H(s) = J r ^ 1 (pJ r (s)). 

Proof: The left hand side of (fT5l) is the expectation of variable (1 — d f ) log 2 (l + s(i)). This variable 
is nonzero only when dj = 0. From (fT8l) we observe that d { = if pj z {s{i)) > J r (r(i)), which is 
equivalent to s(i) > J r_1 (J r (r(z))/p). Therefore, the domain of integration for calculating the expectation 
of (1 — dj)log 2 (l + s(i)) is s(i) > J r_1 (J r (r(i))/p) and r(i) > 0, which leads to the left hand side of 
(l20l) . Using a similar approach, the right hand side of (l20l) is obtained from the right hand side of (fT5l) . 
This concludes the proof. ■ 
Remark 5: Eq. (1201) reveals that the optimal decision threshold, p opt , depends indeed on the statistical 
properties of both involved links as was already alluded to in Remark [3] 



D. Rayleigh Fading 

For concreteness, we provide in this subsection expressions for p opt and the corresponding maximum 
throughput r max for Rayleigh fading links. Thereby, the optimal and a suboptimal decision function 
are considered. 
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1 ) F{x) = x: In this case, the limits G(r) and H(s) in (l20l) are given by G(r) — r/p and H(s) = ps. 
Thus, after some elementary manipulations, (1201) simplifies to 



1 



4 



exp — £x — - exp — ^ — 



pfi< 



exp 



Oj? + ptt s 
ptisan 



1 



+ pn s 



exp 



Ex 



tt R + pQ s 



0. 



(21) 



pV-S^R 

The optimal value of p for T{x) = x, p op t,i, can be obtained from (|2TT) via a simple one dimensional 
search. 

The corresponding maximal throughput can be obtained from r maXi i = E{di log 2 (l +r(i))} and is given 

by 



1 



7"max, 1 



pQ< 



n R j in(2)n R + P n s 



exp 



Cl R + pO< 

pf2sf2 



Ex 



Vt R + p^< 

pf2sf2 



R 



(22) 



ln(2) 
where p = p opM . 

2J J"(x) = log 2 (l + x): In this case, we have G(r) = (1 + r)p - 1 and if(s) = (1 + s) p - 1 in (1201) . 
Thus, after some manipulations, we obtain from (|20l ) 



exp 



(r + l)p 



exp 



(s + iy- i 



In (V + 1> 



ln((s + + e n fl^i 



(r + l)p 
(a + l) p 



*Lr 



x -p-exp 

"5 



r 
s 



cfe = 0. (23) 



The optimal p, p op t,2, can be found numerically from (1231) . The corresponding maximum throughput is 
obtained as 



' max, 2 



exp 



n 



R 



i f(s + l) p 
x hx{{s + l) p ) + e^Ex 



n 



R 



— exp ( --— ) ds, (24) 



ln(2) J 
where p = p opt)2 . 

5) Special case (Qs = &>r)-' For the special case f2g — fi/j — f2, we obtain from ((2"T|) and (123 
Po P t,i = Po P t,2 = 1, and the corresponding maximal throughput is 



7"max 7"max,l ^max,2 



ln(2) 



n 



n 



21n(2) 



n 



exp - ^ - . 



n 



(25) 



Comparing this throughput with the throughput achievable with conventional buffer-aided relaying without 
adaptive link selection, cf. ©, the gain of adaptive link selection can be characterized by 



Tmax / T( 



conv,2 



n 



2-exp -)Ex - )/ 



n 



n 



n 



> i, 



(26) 
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where the ratio r max /r conVj2 monotonically increases from 1 to 1.5 as D, decreases from oo to zero. We 
note that the results in Section IVT1 reveal that the gain of adaptive link selection is minimum for fig = Qr, 
cf. Fig. [21 

Remark 6: From (fl"8l) it is easy to see that for p = 1 the decision functions J-'(x) = log 2 (l + x) and 
J-'(x) = x are equivalent in the sense that they lead to the same decisions. Hence, both decision functions 
lead to identical throughputs r max = r maXi i = r maXi2 . However, for Q s Popt.i ^ Po P t,2 ^ 1 holds, 

and the decision functions are no longer equivalent and r max 2 > T max x holds. 

IV. Power Allocation 

So far, we have assumed that the source and relay transmit powers are fixed. In this section, we jointly 
optimize the power allocation and link selection policies for buffer- aided relaying. 

A. Problem Formulation and Optimal Power Allocation 

Our goal is to jointly optimize the link selection variable di and the powers Ps(i) and Pr(i) in each 
time slot i such that the throughput is maximized. For convenience, we optimize in the following the 
transmit SNRs without fading 7g(i) and 7_r(z), which may be viewed as normalized powers, instead of the 
powers Ps(i) = ls{i)&n s and Pr{i) = 7i?(«)o"n fl themselves. For a fair comparison, we limit the average 
(normalized) power consumed by the source and the relay to Y. This leads for N — V oo to the following 
optimization problem: 

Maximize : ^ Y*=i ^ log 2 (l + TflWtaW) 

Subject to: CI: ^ EiliC 1 ~ d i) lo &i{ 1 + T^W^W) = Eii di^i 1 + lR{i)h R {i)) ^ 
C2 : -di) = 

C3 : i Ef=i(l - *)7s(0 + jf Eli dilnii) < T 
where constraints CI and C2 are identical to the constraints in (flTl) and C3 is the joint source-relay power 
constraint. The solution of Problem (T27T) is summarized in the following theorem. 

Theorem 4: The optimal (normalized) powers 75 (i) and 7#(i) and decision variable di maximizing the 
throughput of buffer-aided relaying with adaptive link selection while satisfying an average source-relay 
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power constraint are given by 



7sW 



7i?W 



p/A - l/h s (i) if h s {i) > X/p 

otherwise 

1/A - l/h R (i) if h R (i) > X 

otherwise 



(28) 



(29) 



1 if 



In ' 
OR 



A 



+ 4iy- 1 >pWfM*)) + 



h S (i) 



p AND ^(i) > A AND h s (i) > 



h R (i) > X AND h s (i) < J 



(30) 



otherwise 

where the optimal p, p opt , and the optimal A, A opt , have to satisfy CI and C3 in (1271) for N — > oo with 
equality. 

Proof: Please refer to Appendix [Ql ■ 
Remark 7: Similar to the case without power allocation discussed in Section [Till it is also possible 
for the case with power allocation to simplify the link selection policy in (l30l) . For example, a simple 
suboptimal link selection policy which only depends on the instantaneous channel gains h s (i) and h R (i) 
but is independent of the transmit powers js{i) and 7^(i) may be adopted at the expense of some loss in 
performance. However, in this paper, we do not pursue suboptimal link selection policies for the case of 
power allocation because of space limitation. 



B. Finding the Optimal X and p 

The following lemma establishes two equations from which the optimal A and p can be found. 

Lemma 2: Denote the pdfs of hs(i) and h R {i) by fh s (hs) and fh R {h R ), respectively. Let the transmit 
powers of the source and the relay in time slot i be given by (T28T) and (T29l) . respectively, and the link 
selection variable d { by (l30l) . Then, the optimal p, p opt , and the optimal A, A opt , maximizing the throughput 
of buffer-aided relaying with adaptive link selection and power allocation have to fulfill the following two 



equations 



Ux/p 

X/p 



( ) 



fh R (h R )dh R + 



phc 



L 

J J lo §2 (jjf) ^« (h R )dh R fh s (h s )dh s + j ^ j log 2 (^f^j fh R (h R )dh R 



X 



fh s (h s )dh s 



fh R (h R )dh R 



fh s (h s )dh s , 
(31) 
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a r poo 



X/p 



P-1 

A h f 



fh s {hs)dh s 



fh R {h R )dh R 



o 



+ 

= r 

where 



'1 1 

A h R 



fh R {h R )dh R 



fh s {h s )dh s 



X/p 



p 


1 


A 




oo 




2 


a- 



fh s {h s )dh s 



fh R (h R )dh 



fh R {h R )dh 



R 



A 



A 



fh s (h s )dh s 
(32) 

(33) 



^(_ e (/ lH -A)/(«-i(A// iB )Vp)' z ty(-e^ 1 - A ^s(A/(p/ l5 ))p)- 
Here, W(-) is the Lambert ^/-function [17], which is available as built-in function in software packages 
such as Mathematica. The maximum throughput is given by the left (and right) hand side of (mi) . 

Proof: Please refer to Appendix [0 ■ 
To find A op t and p op t meeting (I3TT) and (1321) . a two dimensional search over all A > and p > has to 
be conducted or built-in root-finding functions of commercially available software such as Mathematica 
can be used. The optimal A and p can be found offline since (13D) and (1321) only depend on the statistical 
properties of the S-1Z and the 1Z-D links. Since these statistical properties change on a much slower time 
scale than the instantaneous channel gains, A opt and p opt can be updated with a low rate. 



C. Rayleigh Fading 

For the special case of Rayleigh fading with f hs (h s ) = e- hs/ns /Cl s and f h . R (h R ) = e~ hR/ClR /Vl R , <E 
can be simplified to 

'pL 



ln(2) 



1 - e' x/nR ) E x 



ln(2) L 

and (|32|) can be simplified to 



A 
ptt< 



1 - e- A ^) )Exl±- 
Q R 



X/p 



A 



n 



R 



,(34) 



1-e 



A 



He 




dh s = T, (35) 



where L\ and L 2 are given in (1331) and the maximum throughput is given by the left (and right) hand side 
of equation (l34l) . 
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V. Delay Limited Transmission 



So far, we have assumed that there is no delay constraint and that the size of the buffer at the relay is 
infinite. In practice, there is usually some constraint on the delay and on the buffer size. In this section, we 
investigate how these constraints affect the performance of the proposed relaying protocol. For simplicity, 
we assume fixed transmit powers, i.e., Pg(i) = Ps, -Pr(«) = Pr, Vi, and consequently, policy (fT8l) is used 
for link selection. 

Since we assume that the source is backlogged and has always information to transmit, for the considered 
three-node network, the transmission delay is caused only by the buffer at the relay. Let T(i) denote the 
delay of a bit that is transmitted by the source in time slot i and received at the destination in time slot 
i + T(i), i.e., the considered bit is stored for T{i) time slots in the buffer. Then, according to Little's law 
[18] the average delay E{T(i)} (i.e., the average time that a bit is stored in the buffer) is given by 



where E{Q(i)} is the average queue length at the buffer and A is the average arrival rate into the queue. 
E{Q(i)} is given in bits and A is given in bits/slot. Thus, the average delay E{T(i)} is given in time 
slots. From (l36l) . we observe that the delay can be controlled via the arrival rate and the queue size. In 
the following, we will present two different approaches to adjust the arrival rate and the queue size. The 
first approach is to "starve" the buffer, i.e., we intentionally limit the arrival rate by choosing p < p opt . 
The second approach is to limit the buffer size by forcing the relay to transmit if the buffer gets full. 

A. Satisfying an Average Delay Constraint by "Starving " the Buffer 

Starving the buffer is a common approach for limiting average delays in queueing systems [19], [20]. In 
our case, we can decrease the average arrival rate by selecting p < p opt which leads to E{(1 — di)S(i)} < 
E{diR(i)}. In the following theorem, we establish an upper bound for the resulting average delay. 

Theorem 5: Let p < p opt in ([TBI such that £ = E{(1 — di)S(i)} / 'E{diR(i)} < 1. In this case, assuming 
slot-by-slot uncorrelated fading the average delay in slots is bounded by 



E{T{i)} = E{Q(i)}/A, 



(36) 




E{(1 - dJS 2 ®} + £(2 - £)£{<U2 2 (i)} 



(37) 



E{diR(i)}-E{(l-di)S(i)} 



and the throughput is given by r = E{(1 — di)S(i)}. 
Proof: Please refer to Appendix [0 
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Remark 8: Exploiting (1371) the required value of p < p opt to ensure a desired average delay can be found. 
For example, assuming F(x) = x and Rayleigh distributed S-1Z and 1Z-V channel gains the expected 
values required in (1371) can be obtained as 



E{(l-di)S(i)} 



E{diR{i)} 



ln(2) 



exp 



1 



ln(2) L 



n 



E{diR 2 {i)} 



exp — )E 1 — 



1 



1 



exp 



VL R + pO< 



exp 



tt R + p^< 
pf2sf2 



R 



E x 



+ ptt s 

ptls^R 



(log 2 (l + S )) 5 



_Jr/p 



-c/s 



-<ir 



(lot 



r)) 



R 



-s/a s 



-<is. 



(38) 
(39) 
(40) 
(41) 



Inserting now (f38T)-(i4Tb into (T37T) . the value of p guaranteeing a certain average delay can be found by 
slowly increasing p from zero until the right hand side of (1371) equals the desired average delay E{T(i)}. 

If the buffer size is limited, there is a non-zero probability that the bits arriving into the buffer have to 
be dropped because the buffer is full, even if the buffer is starved. However, the probability of this event 
happening can be minimized by properly choosing the buffer size compared to the desired average delay. 
This issue is addressed in the following lemma. 

Lemma 3: Denote the maximum queue size by Q max . Then, we can bound the probability that the 
queue in the buffer exceeds Q mSLX as 

Pr{Q(z) > Q max } < E{Q(i)}/Q max . (42) 
Proof: The proof follows directly from Markov's inequality. ■ 
We can guarantee any prescribed probability of dropped bits, Pr{Q(i) > Q max }, by selecting appropriate 
values for p and Qmax based on (|42)) and (l64l) given in Appendix [0 The resulting throughput is given by 

r = E{(1 - di)S(t)\Q(i) < Q max }Pr{Q(i) < Q max } = E{(1 - di)S{i)}Pi{Q(i) < Q max } 

= E{(1 - di)S(i)}(l - Pr{Q(z) > Q max }), (43) 

for which a lower bound can be found by combining (l42l) . (l43l) . and (|64|) . 



B. Satisfying the Delay Constraint by Limiting the Queue Size 

For the scheme proposed in the previous subsection dropped bits are unavoidable. In this subsection, 
we propose an alternative approach which allows us to avoid dropped bits. Let the buffer size again be 
limited to Qmax bits. The proposed scheme employs the following link selection protocol: 
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1) If Qmax - Q(i - 1) > S(i), select di based on (fT8l) . 

2) Otherwise, set dj = 1. 

Hence, if there is enough room in the buffer to accommodate the bits possibly sent from the source to the 
relay, the link selection protocol introduced in Section [III] is employed. On the other hand, if there exists 
the possibility of a buffer overflow, the relay transmits to reduce the amount of data in the buffer. 

Remark 9: Although conceptually simple, a theoretical analysis of the throughput of the queue size 
limiting protocol is difficult. In contrast to the buffer starving protocol discussed in Section IV-Al for the 
queue size limiting protocol, the average arrival rate A, depends on the frequency with which the buffer 
has to be emptied due to a full queue. The frequency of these events depends in turn on the average 
arrival rate. This mutual dependence of average arrival rate and emptying the buffer makes a meaningful 
theoretical analysis difficult. Thus, we will resort to simulations to evaluate the performance of the queue 
size limiting protocol in Section [VH 

Remark 10: We note that both proposed protocols for the delay constrained case are heuristic in nature. 
The search for other protocols with possibly superior performance is an interesting topic for future work. 
The proposed protocols for the delay constrained and the delay unconstrained case can serve as benchmark 
and performance upper bound for these new protocols, respectively. 

VI. Numerical and Simulation Results 

In this section, we evaluate the performance of buffer-aided DF relaying with adaptive link selection 
and compare it with that of conventional relaying. Throughout this section, we assume Rayleigh fading. 
All results shown in this section have been confirmed by computer simulations. However, the simulations 
are not shown in all instances for clarity of presentation. 

A. Delay Unconstrained Transmission 

First, we assume that there are no delay constraints and investigate the achievable throughputs with and 
without power allocation. 

1 ) Throughput of Buffer-Aided Relaying with Adaptive Link Selection: We first consider the case of 
fixed transmit powers. In Fig. [2] we show the ratio of the optimal throughput of buffer-aided relaying with 
adaptive link selection, r max , and the throughput of conventional relaying with a buffer, r conVj2 , given in ©, 
as a function of CIr/CIs for several different values of fi^. The corresponding optimal decision thresholds, 
p op t, for buffer-aided relaying with adaptive link selection are shown in Fig. [3] For buffer-aided relaying 



18 



with adaptive link selection, we considered the decision functions JF{x) = x and F{x) = log 2 (l + x) 
and calculated the corresponding throughputs based on (1221) and d24l) . respectively. The optimal decision 
thresholds were obtained from (TfTj) and (1231) . respectively. Clearly, buffer-aided relaying with adaptive 
link selection leads to substantial throughput gains compared to conventional relaying. Both considered 
decision functions lead to very similar performances, although at very high ratios £l R /Q s the optimal 
decision function JF{x) = log 2 (l + x) yields a small throughput gain. The ratio r max /r conv 2 approaches 
two as £l R /n s — > and £l R /Q s — > oo. For Q R /n s — > 0, the source-relay link is selected very rarely for 
transmission (as p opt — > 0) since comparatively large amounts of data can be transferred to the relay in a 
single time slot. Thus, the relay can almost always transmit as compared to half of the time in conventional 
relaying. On the other hand, for £l R /Q s — > oo, it is the relay-destination channel that is used very rarely as 
p opt — > oo and the source can transmit almost all the time, which results in twice the throughput compared 
to conventional relaying. 

2) Throughput with Power Allocation: In Figs. 0] and [51 we investigate the gains achievable with power 
allocation (PA) for a system with 0,$ = 0.1 and Cl R = 1.9. Thereby, we compare the performances of 
buffer-aided relaying with adaptive link selection and conventional relaying with and without a buffer. For 
buffer-aided relaying with adaptive link selection and power allocation the throughput, power allocation, 
and link selection policy were obtained as described in Theorem [4] and Lemma [2] in Section [IV] For 
conventional relaying with buffer, a similar optimal power allocation scheme as for buffer-aided relaying 
with adaptive link selection was adopted with variable powers ^(i) = max{0, 1/a — l/hj(i)}, j G {S, R}, 
where a is chosen such that E{^j{i)} = T. For comparison, in Figs.@]and[5l we also show the performance 
of both considered relaying schemes without power allocation, i.e., we set js{i) = 7fl(i) = T. Furthermore, 
to highlight the gain compared to conventional relaying without buffer and without power allocation, in 
Fig.|H we normalized the throughput with respect to r conv j as given by d8]). Figs. |4] and [5] show that optimal 
power allocation can improve performance of both buffer-aided relaying with adaptive link selection and 
conventional relaying. For example, for T = dB buffer-aided relaying with adaptive link selection and 
power allocation leads to a throughput gain of 95 % compared to conventional relaying with buffer and 
power allocation. Nevertheless, the gain achievable by adaptive link selection is more significant than the 
gain from power allocation. For example, at T = 20 dB, adaptive link selection yields a throughput gain 
of 1 bit/slot compared to conventional relaying with buffer. 
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B. Delay Constrained Transmission 

We now turn our attention to delay limited transmission and investigate the performance of the two 
proposed protocols for this case. In the following we assume uncorrelated fading. Furthermore, we assume 
fixed transmit powers for the source and relay, and adopt the suboptimal decision function JF{x) = x. 

1) Starving the Buffer: In Fig. [6l we show the ratio of the throughput of buffer-aided relaying with 
starved buffer, r max , and the throughput of conventional relaying without buffer, r conV; i, as given in ®, 
as a function of the upper bound on the average delay. The corresponding decision thresholds, p, are 
shown in Fig. [7J For the theoretical results shown in Figs. [6] and [7J the throughput and the upper bound 
on the average delay for buffer-aided relaying with starved buffer were obtained based on Theorem [5] and 
Remark [U A comparison of the theoretical results with the simulation results also shown in Fig. [6] reveals 
that the derived upper bound on the delay is tight, especially for large delays. On the other hand, for very 
small delays, buffer-aided relaying with starved buffer becomes inefficient since the starving of the buffer 
decreases the average queue size and increases the probability that the relay is selected for transmission 
when R(i) > Q(i — 1). In fact, Fig. [6] shows that for very small delays buffer-aided relaying with starved 
buffer may be even outperformed by conventional relaying without buffer. As expected, the throughput 
of buffer-aided relaying increases with increasing tolerable delay and p approaches the optimal value for 
the delay constrained case, p opt (computed from (121]) ). for large delays. The required delay to achieve a 
throughput gain compared to conventional relaying increases with increasing CIs-£Ir ratio since for large 
£Is/Qr an d small tolerable average delays, the arrival rate into the buffer has to be severely limited (i.e, 
p has to be chosen very small) which has a negative impact on the throughput which, by the conservation 
of flow, is equal to the arrival rate. 

In Fig. [H we show the probability of a dropped bit as a function of the buffer size Q max for three 
different average delays and fig = VLr = 1. These results were obtained via simulations since the bound 
obtained in (|42|) is relatively loose due to the looseness of Markov's inequality. Fig. [8] shows that the 
probability of dropping a bit rapidly decreases with increasing buffer size and decreasing average delay. 

2) Limiting the Queue Size: In Fig. [9l we show the throughput achieved by limiting the queue size, 
Tiimit, and the throughput achieved by starving the buffer, r starvc , normalized by the throughput of con- 
ventional relaying without buffer, r conv>1 , as given in ([8]), for symmetric and asymmetric link qualities. 
For comparison, Fig. [9] also contains the throughput of delay unconstrained buffer-aided relaying with 
adaptive link selection, which constitutes an upper bound for the throughput in the delay constrained 
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case, and conventional relaying with buffer as proposed in [13]. For conventional relaying with buffer, 
the relay drops information bits if the achievable rate of the S-1Z link in the first N/2 time slots exceeds 
the achievable rate of the 1Z-V link in the second N/2 time slots. All results shown in Fig. [9] have been 
obtained by simulations. Fig. [9] reveals that the performance of both delay constrained buffer-aided relaying 
protocols is comparable for large delays and approaches that of the delay unconstrained protocol. For small 
delays, limiting the buffer size yields a higher throughput than starving the buffer. However, both proposed 
protocols may be outperformed by conventional relaying with and without buffer for very small delays as 
for the proposed simple protocols, the relay may be selected for transmission even if R(i) > Q(i — 1). 
While this event has negligible effect for delay unconstrained transmission since the optimal link selection 
policy ensures that the queue is sufficiently long such that R(i) > Q(i — 1) is avoided (cf. Remark |2]), 
this is no longer true for the delay constrained case. Therefore, for the delay constrained case, more 
sophisticated protocols should be developed that take into account that R(i) > Q(i — 1) may occur. 

VII. Conclusions 

In this paper, we proposed a novel adaptive link selection protocol for relays with buffers. In contrast 
to conventional relaying, where the source and the relay transmit according to a pre-defined schedule 
regardless of the channel state, in the proposed scheme, always the node with the stronger link is selected 
for transmission. For delay unconstrained transmission, we derived the optimal link selection policy for 
the cases of fixed and variable source and relay transmit powers. Remarkably, in both cases, the optimal 
policy for a given time slot only depends on the instantaneous CSI of that time slot and the statistical CSI 
of the involved links. This makes the optimal policies attractive for implementation. For delay constrained 
transmission, we proposed two different methods to control the delay introduced by the buffer at the 
relay. Furthermore, for the case when the buffer is starved, we derived upper bounds on the average delay 
and the number of dropped bits for limited buffer size. Our analytical and simulation results showed that 
buffer-aided relaying with adaptive link selection is a promising approach to significantly increase the 
throughput compared to conventional relay-assisted transmission. Interesting extensions of the presented 
work include using the considered simple three-node network as a building block for larger networks, 
studying the impact of imperfect CSI, and deriving the outage probability for fixed rate transmission. 
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Appendix 

A. Proof of Theorem [7J 

We first note that, because of the law of the conservation of flow, A > r is always valid and equality 
holds if and only if the queue is non- absorbing. Assume first we have a link selection policy with average 
arrival rate A and throughput r with A > r, i.e., the queue is absorbing. For this policy, we denote the 
set of indices with di = 1 by I and the set of indices with di = by /, i.e., for N — > oo we have 

A = lv E^ 1 - d ^ S & > T = E di min {^), Q(i ~ !)}• (44) 

i€l i£l 

From (1441) we observe that the considered protocol cannot be optimal as it can be improved by moving 
some of the indices i in I to I which leads to an increase of r at the expense of a decrease of A. 
However, once the point A = r is reached, moving more indices i from I to I will decrease both A and r 
because of the conservation of flow. Thus, a necessary condition for the optimal policy is that the queue 
is non-absorbing but the queue is at the edge of non-absorbtion, i.e., the queue is at the boundary of a 
non-absorbing and an absorbing queue. This completes the proof. 

B. Proof of Theorem 

We denote the sets of indices i for which di = 1 and d^ = holds by / and /, respectively, e denotes a 
subset of / and | ■ | is the cardinality of a set. Throughout the remainder of this proof N — > oo is assumed. 

If the queue in the buffer of the relay is absorbing, A > r holds and on average the number of bits 
arriving at the queue exceed the number of bits leaving the queue. Thus, R{i) < Q(i — 1) holds almost 
always and as a result the throughput can be written as 

r = ^ E min w), Q(i - 1)} = i E ^ 

Now, we assume that the queue is at the edge of non-absorption. That is A = r holds but moving a 
small fraction e, where |e|/iV — > 0, of indices from I to I will make the queue an absorbing queue with 
A > t. For this case, we wish to determine whether or not 

1 m > r = ^ E W), Q(i - 1)} = A = i J2 S(i) (46) 
iei iei 

holds. To test this, we move a small fraction e, where |e|/iV — > 0, of indices from I to I, thus making the 
queue an absorbing queue. As a result, (|45T) holds, and (l46l) becomes 

1 = r = ^ E nun{J2(t), Q(i - 1)} < A = 1 ^ 5(0- (47) 
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From the above we conclude that if (l45l) holds, then based on (|46|) and (1471) . for |e|/iV — >• 0, we must have 
and 



N ^ w TV 



£ E *M < \ E S «- («> 

iel\e ie/Ue 

However, for (|48l and (|49l ) to jointly hold, we require that the particular considered moving of indices from 
I to I has caused a discontinuity in ■h'^2 ie iR{i) or / an d a discontinuity in ^X)iei J ^'(*) as l e l/^ ~~ * 
is assumed. Since the capacities of the S-1Z and 1Z-V links are such that lim A r_ s . 0O S(i)/N — > and 
liniAr^oo R(i)/N — >■ 0, Vi, such discontinuities are not possible. Therefore, at the edge of non- absorption 
(l46l) is not true and we must have instead 

i £ J2(t) = r = 1 £ mm{R(i), Q(i - 1)} = A = 1 £ 5(<) (50) 

Using the the ergodicity of s(i) and r(i), (l50l) can be expressed as (fT5l) . and the throughput can be written 
as (TT6l) . This concludes the proof. 

C. Proof of Theorem \3\ 

The Lagrangian for Problem (fT71) is given by 

N N N 

j=l i=l i=l 

where // and /?, are Lagrange multipliers. Differentiating £ with respect to di and setting the result to zero 
leads to 

For — di) — to hold, we need either di = or di = 1, which leads to two possible values for 

d< = A,i = (1 - - ^(i) (53) 

d 4 = 1 ft, 2 = -Pi,i (54) 

For the maximum of £ in (15TI ), < 0, Vz, has to hold. Furthermore, < \i < 1 has to hold since for 
ji < and /i > 1 we have always dj = 1 and di = 0, respectively, irrespective of the values of R(i) and 
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S(i). Hence, we have 

/ 1 if (1 - fi)R(i) - [iS(i) > 
di = < (55) 

if (1 - fi)R(i) - fiS{i) < 

which is identical to (1T8~1) with JF{x) = log 2 (l +x) if we set p — fJ>/(l — if), p or equivalently p are chosen 

such that constraint CI of Problem (1171) is met. This completes the proof 

D. Proof of Theorem 



To solve Problem (1271) . we form the Lagrangian 

1 N 1 N 

£= ^"l2 d i ^Sal 1 + lR{i)h R (i)) - /u— [dj log 2 (l + ~/ R (i)h R (i)) - (I - di) log 2 (l + ^s{i)h s {i)) 

i=l i=l 
1 N 1 N 

~ U nY1 [t 1 - d ^s(i) + dij R {i)\ -j^Yl ~ (56) 

i=l i=l 

where the Lagrange multipliers /i, flu and ^ are chosen such that CI, C2, and C3 are satisfied, respectively. 
By differentiating C with respect to 7s (i), 7ij(i), and dj, and setting the results to zero, we obtain three 
equations. Solving this system of equations for 7s (i), 7i?(i), and di, and taking into account that < 0, 
< /i < 1, and f > 0, we obtain (|28l> . (|29b . and ([30]) after letting p = p/(l -p) and A = i/ln(2)/(l - p), 
which are chosen such that constraints CI and C3 are meet with equality. This completes the proof. 

E. Proof of Lemma [2] 

Since s(i) and r(i) are ergodic random processes, for N — > oo, the normalized sums in CI and C3 in 
(1271) can be replaced by expectations. Therefore, the left hand side of CI is the expectation of variable 
(1 — di) log 2 (l + 'j s (i)hs(i)). This variable is nonzero only when both (1 — dj) and 7s(«) are nonzero. 
The domain over which (1 — di) and 7s(«) are jointly nonzero can be obtained from (l28l) and (1301) and is 
given by 

(h s (i) > A/p AND h R (i) < A) OR (h s (i) > L x AND h R (i) > A) (57) 

where L\ is given by (l33l) . Variable (1 — di) log 2 (l + 7s(«)^5( z ')) nas t0 be integrated over domain (l57l) 
to obtain its average. This leads to the left side of OTI) . 

Similarly, the right hand side of CI is the expectation of the variable djlog 2 (l + ^R{i)h R {%)). This 
variable is nonzero only when both dj and 7n(i) are nonzero. The domain over which di and 7jj(«) are 
jointly nonzero can be obtained from (l29l) and d30l) and is given by 

{h R {i) > A AND h 8 {i) < A/p) OR {h R {%) > L 2 AND /i 5 (i) > A/p) (58) 
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where L 2 is given by (1331) . Variable di log 2 (l + 7r nas t0 t> e integrated over domain (1581) to obtain 

its average. This leads to the right side of (I3TT) . 

Following a similar procedure, we can obtain (|32l ) from C3 in (|271) . This completes the proof. 

F. Proof of Theorem \5\ 

For £ = — di)S'(i)}/E'{<ij.R(z)} < 1 the queue is non-absorbing, and thus, because of the law of 
conservation of flow, the throughput is equal to the arrival rate, i.e., r = E{(1 — di)S(i)}. 

To arrive at an upper bound for the average queue size, we first introduce two auxiliary results from 
the literature. Let 

q(i) = max{g(z — 1) — u(i), 0}, (59) 

where u(i) is a slot by slot uncorrelated random variable with E{u(i)} > 0. Also, let a(i) and b(i) be 
non-negative slot by slot uncorrelated random variables with E{b(i)} > E{a{i)} and set u(i) = b(i) — a(i). 
Then, equality [21] 

E{u 2 {i)} - 2E{u{i)}E{q{i)} = E{(max{u(i) - q(i), 0}) 2 } (60) 

and inequality [22] 

^ A E{a(i)}\ „ ri n. 



E{(max{u(i) - g(i), 0}) 2 } > ( 1 - ) E{b\t)} (61) 



hold. Furthermore, combining (1601) and (1611) . the following bound is obtained [22] 

E{q{t)} ^ 2 E{b(i)} - E{a(i)} ' (62) 

where £ = E{a(i)}/E{b(i)}. 

By rewriting the queue size as 

Q(i) = max {Q(i - 1) - ^(z) + (1 - d t )S(i), 0} , (63) 

we observe that (l63l) is in the form of (l59l) if we let q(i) = Q(i), a(i) = (1 — di)S(i), b(i) = diR(i) and 
£ = £'{(1 — dj)S'(z)}/£'{d i i?(i)}. Thus, assuming that s(i) and r(i) are slot by slot uncorrelated, we can 

exploit (|62|) and upper bound the average size of the queue as 

1 E{(1 - di)S 2 (i)} + E{diR 2 (i)} - 2E{(1 - di)diS(i)R(i)} - (1 - £) 2 E{diR 2 {i)} 



Em)} < 



2 EidiR^-Eiil-d^Sii)} 
1E{(1- di)S 2 (i)} + £(2 - £)E{diR 2 (i)} 



(64) 



2 E{rf iJ R(z)}-E{(l-d l )5(z)} 
Since the average arrival rate is given by A = E{(1 — di)S(i)}, we obtain (1371) from (|64|) and Little's law 

(l36l) . This completes the proof. 
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Fig. 1. System model comprising a source S, a half-duplex relay equipped with a buffer 1Z, and a destination 27. s(i) and r(i) are the 
instantaneous signal-to-noise ratios (SNRs) of the S-TZ and TZ-V links in the ith time slot, respectively. 




n R /fi s 

Fig. 2. Throughput ratio r max /T conv .2 vs. Qn/f2,g for buffer-aided relaying with adaptive link selection and fixed transmit powers for source 
and relay. 
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Fig. 3. Optimal decision threshold p op t vs. Hr/Qs for buffer-aided relaying with adaptive link selection and fixed transmit powers for 
source and relay. 
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r (in dB) 

Fig. 4. Throughput normalized to r CO nv,i vs. T for buffer-aided relaying with adaptive link selection and conventional relaying with buffer. 
The performance of both schemes with and without power allocation (PA) is shown. Cls = 0.1 and CIr — 1.9 
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Fig. 5. Throughput vs. F for buffer-aided relaying with adaptive link selection and conventional relaying with buffer. The performance of 
both schemes with and without power allocation (PA) is shown. Qs = 0.1 and CIr = 1.9 
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2.2 




Upper Bound on the Average Delay (in Slots) 



Fig. 6. Throughput ratio r max /T conv ,i vs. upper bound on the average delay for buffer-aided relaying with adaptive link selection and 
starved buffer. J-(x) = x. 
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Average Delay (in Slots) 



Fig. 7. Decision threshold p vs. upper bound on the average delay for buffer-aided relaying with adaptive link selection and starved buffer. 

^(aj) = x. 
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Max Queue Size, Q 



Fig. 8. Probability of a dropped bit vs. queue size Q max for buffer-aided relaying with adaptive link selection and starved buffer, fls = 

Qr = 1, J-(x) — x. 
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Average Delay (in slots) 

Fig. 9. Throughput ratio r m ax/T CO nv,i vs. average delay for the proposed buffer-aided relaying schemes with starved buffer and limited 
buffer and conventional relaying with buffer [13]. 



